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We compute exactly the distribution of the occupation time in a discrete non-Markovian toy 
sequence which appears in various physical contexts such as the diffusion processes and Ising spin 
glass chains. The non-Markovian property makes the results nontrivial even for this toy sequence. 
The distribution is shown to have non-Gaussian tails characterized by a nontrivial large deviation 
function which is computed explicitly. An exact mapping of this sequence to an Ising spin glass chain 
via a gauge transformation raises an interesting new question for a generic finite sized spin glass 
model: at a given temperature, what is the distribution (over disorder) of the thermally averaged 
number of spins that are aligned to their local fields? We show that this distribution remains 
nontrivial even at infinite temperature and can be computed explicitly in few cases such as in the 
Sherrington-Kirkpatrick model with Gaussian disorder. 
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I. INTRODUCTION 

The occupation time T of a stochastic process x(t) is 
simply the time that the process spends above its mean 
value (say 0) when observed over the period [0, t], 



9[x{t')\dt', 



(1) 



where 9[x] is the Heaviside step function and we assume, 
for simplicity, that the process starts at x(0) = 0. Since 
the seminal work of Levy who computed the exact 
probability distribution of T in the case when x(t) is just 
an ordinary Brownian motion, there has been a lot of 
interest in the mathematics community to study the oc- 
cupation time for various processes [^|j3). Recently the 
study of the occupation time has seen a revival in the 
physics community in the context of noncquilibrium sys- 
tems [Qj5| due to its potential applications in a wide range 
of physical systems which include, amongst others, op- 
tical imaging ||, analysis of the morphology of growing 
surfaces |7| and analysis of the fluorescence intermittency 
emitting from colloidal semiconductor dots § . 

The occupation time T is clearly a random variable. 
Its probability distribution P(T, t) evidently depends on 
the window size t. It turns out that quite generically 
there are essentially two types of asymptotic behaviors 
of this distribution P(T, t) depending on whether the 
underlying stochastic process x(t) is non- stationary or 
stationary. A non-stationary process is one where the 
two-time correlation function C(t,t') = (x(t)x(t')) de- 
pends on both times t and t' . An example is the ordinary 
Brownian motion, dx/dt = r)(t) where rj(t) is a Gaussian 
white noise with (n(t)) = and {rj{i)r)(t')) = S(t - t'). 
In this case, C(t, t') = mm(t, if). In a stationary process, 
on the other hand, the two-time correlation function de- 
pends only on the time difference, C(t,t') = C{\t — t'\). 



A simple example of a stationary process is the Orstein- 
Uhlenbeck process, dx/dt = —Xx + rj(t)^ where a particle 
moves in a parabolic potential in presence of external 
thermal noise. In this case, the particle reaches equilib- 
rium at long times when the two-time correlation simply 
becomes, C(t,t') = exp[— A|t — t'\]. 

In the non-stationary case, one expects that in the 
asymptotic limit t — > 00, T — > 00 but keeping the ra- 
tio r = T/t fixed, the distribution P(T, t) has the generic 
scaling behavior, 



1 (T 



(2) 



where the scaling function f(r) has nonzero support only 
in the range r S [0, 1]. For example, in the case of or- 
dinary Brownian motion, the scaling function f(r) can 
be computed exactly Q, f(r) = l/ny/r(l — r). This 
is known as the Arc-Sine law of Levy since the cumu- 
lative distribution has an arc-sine form, J Q f(r')dr' — 
2sin _1 (\/r) /it. Note that for the Brownian case the scal- 
ing actually holds for all t and T. The analytical calcu- 
lation of this scaling function f(r) is, however, nontrivial 
even for this simple Brownian case. Following the work 
of Levy, there have been various generalizations of this 
Arc-Sine law. For example, the scaling function f(r) has 
been computed exactly for the so called Levy processes 
H , and recently for a more general class of renewal pro- 
cesses [|| . The occupation time distribution has also been 
studied recently for a Brownian particle moving in a ran- 
dom Sinai type potential and the corresponding scaling 
function f(r) has been computed exactly 

For stationary processes, on the other hand, the distri- 
bution P(T, t) is expected to have the following generic 
asymptotic behavior in the appropriate scaling limit T — > 
00, t — ► 00 with the ratio r = T/t fixed |10|, 



P(T,t) 



-t*(T/t) 



(3) 



1 



where $(r) is a large deviation function with, in general, 
non-Gaussian tails jn|. For example, for the Ornstein- 
Uhlenbeck stationary process discussed in the previous 
paragraph, the function $(r) has recently been computed 
exactly by utilizing a mapping to a quantum mechanical 
path integral problem p0| . 

The calculation of either the scaling function /(r) for 
non-stationary processes or the large deviation function 
$(r) for stationary processes is a challenging theoreti- 
cal problem. So far, exact results exist only for Markov 
processes where the value of the process x(t) at time t 
depends only on its value at the previous time step, say 
at t — At where At is an infinitesimal time step, but is 
completely independent on the previous history of the 
process. For example, the ordinary Brownian motion 
and the Orstein-Uhlenbeck processes are both Marko- 
vian. On the other hand, most processes in nature are 
non-Markovian and the Markov processes are more of 
exceptions rather than rules. Non-Markov processes are 
known to be notoriously difficult for the analytical calcu- 
lation of even simpler quantities such as persistence, i.e., 
the probability that the process does not change sign up 
to time t p| . Naturally the analytical calculation of 
the occupation time distribution for non-Markovian pro- 
cesses is even more difficult. 

For a certain class of 'smooth' non-Markovian pro- 
cesses such as the diffusion equation, it is possible to 
compute the occupation time distribution Q] using the 
independent interval approximation (IIA) which assumes 
that the intervals between successive zero crossings are 
statistically independent |l2). But these IIA results are 
only approximate. To our knowledge, there exists no 
exact result for the occupation time distribution for a 
non-Markov process, either stationary or non-stationary. 
In this paper, we obtain, for the first time, an exact an- 
alytical result for the occupation time distribution for a 
stationary non-Markovian process. To be more precise, 
we actually study the occupation time distribution of a 
discrete stationary non-Markovian sequence and not a 
continuous stochastic process. Nevertheless the asymp- 
totic behavior as given by Eq. (||) still remains true and 
we compute analytically the corresponding large devia- 
tion function <i>(r). 

Recently the importance of studying the statistical 
properties such as the persistence and the distribution 
of the number of zeros of discrete stochastic sequences, 
as opposed to the more traditional continuous stochastic 
processes, has been emphasized in a number of articles 
-[Tq]. There are two principal reasons for studying 
a stochastic sequence. First, in various experiments and 
numerical simulations, even though the underlying physi- 
cal process is continuous in time, in practice one actually 
measures the events only at discrete time points. The 
result of this discretization can lead to subtle and im- 
portant differences between the 'true' properties of the 
process and the 'measured' properties To estimate 



these differences, it is important to study the properties 
of a discrete sequence. The second reason follows from 
the observation jlj] that many processes in nature such 
as weather records are stationary under translations in 
time only by an integer multiple of a basic period. For 
example, the seasons repeat typically every one year. For 
such processes, it was observed in Ref. that the per- 
sistence of the underlying continuous process coincides 
with that of the discrete sequence obtained from the mea- 
surement of the process only at times that are integer 
multiples of the basic period. 

In particular, in Ref. a specific discrete sequence 
was obtained as a limiting case of the diffusion equa- 
tion on a hierarchical lattice. This rather simple toy se- 
quence, even though non-Markovian, had the remarkable 
property of being solvable for certain statistical proper- 
ties such as the persistence Jjl| and the distribution of 
the number of zeros |fl6|| . Furthermore, these exact re- 
sults were rather nontrivial Jl5|,[l6| even for this toy se- 
quence. It is always important to have a such a solvable 
non-Markovian toy model which can then be used as a 
benchmark to predict the possible expected behaviors of 
various observables in a more complex non-Markovian 
process. In this paper we show that the occupation time 
distribution can also be computed exactly for this toy 
model and like other quantities such as the persistence, 
it is rather nontrivial even for this simple toy model. 

We further make an interesting observation that the 
occupation time for this toy sequence is related to a spe- 
cific physical observable in an Ising spin glass chain with 
nearest neighbor interactions. In a given sample of the 
spin glass chain, one can ask: what is the average (ther- 
mal) number of spins that are aligned to the direction 
of their local fields? This physical object is a random 
variable that fluctuates from one sample of disorder to 
another. A natural question is: what is the probabil- 
ity distribution (over disorder) of this thermal average? 
It turns out that this distribution is nontrivial even at 
infinite temperature. In fact, we show that at infinite 
temperature this distribution in the spin glass chain co- 
incides exactly with the occupation time distribution of 
the toy sequence mentioned above. This connection is 
useful as it raises a general question for any spin glass 
model (and not just restricted to a chain): what is the 
probability distribution of the average (thermal) number 
of spins that are aligned to their local fields? In this 
paper, we show that the analytical computation of this 
distribution in the limit of infinite temperature, though 
still nontrivial, is tractable in few cases. In particular, we 
calculate analytically this infinite temperature distribu- 
tion in the Sherrington-Kirkpatrick (SK) model of mean 
field spin glasses fL7| . 

The layout of the paper is as follows. In Sec. II, we 
define the toy sequence, recall some of its properties and 
known results and then compute the occupation time dis- 
tribution exactly. In Sec. Ill, we establish the connection 
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to a spin glass chain and raise the general question re- 
garding the distribution of the average number of spins 
aligned to their local fields in a generic spin glass model. 
In Sec. IV, we compute this distribution analytically 
in the SK model at infinite temperature and show that 
it is nontrivial even at infinite temperature. Finally we 
conclude in Sec. V with a summary and some open ques- 
tions. 



II. THE TOY SEQUENCE AND ITS EXACT 
OCCUPATION TIME DISTRIBUTION 



The toy sequence we study in this section was origi- 
nally derived as a limiting case of the diffusion process 
on a hierarchical lattice M|. This is a sequence {ipi} of 
correlated random variables constructed via the following 
rule, 



ipi = 4>i + 4>% 



=1,2, 



.N, 



(4) 



where </>(i)'s are independent and identically distributed 
(i.i.d) random variables, each drawn from the same sym- 
metric continuous distribution p{(p). Note that even 
though cp(i) 's are uncorrelated, the variables ipi's are 
correlated. The two point correlation function, C^j = 
(ipiipj) can be easily computed from Eq. (^), 



(5) 



where Si.j is the Kronecker delta function and a 2 = 
J_ cj) 2 p(<p)d(p which we assume to be finite. Thus the se- 
quence {ipi} has only nearest neighbor correlation. Also 
note that for large sequence size N, the sequence is sta- 
tionary since Cij depends only on the difference \i — j\, 
and not individually on i or j. The sequence {ipi} is also 
non-Markovian. To see this, one can try to express a 
specific member of the sequence, say ipi, only in terms of 
other members of the sequence Jll| . This can be easily 
done using Eq. (0) and one gets for any i > 2, 



(6) 



k=l 



This relation clearly demonstrates the history depen- 
dence of the sequence in the sense that ipi depends not 
just only on the previous member ipi-i (as would have 
been in the Markov case), but on the whole history of 
the sequence preceding ipi. 

We now turn to the exact computation of the occupa- 
tion time distribution for the sequence in Eq. (JI]). The 
occupation time R in this case is simply the number of 
ipi's that are positive out of the total number N and is 
given by the discrete counterpart of Eq. (|l|), 



N 



(7) 



Clearly R is a random variable over the range < R < N. 
Let us denote its probability distribution by P(R, N) 
which is formally given by, 



P(R,N)= J 8 



N 



n^)*' ( 8 ) 



Analogous to the asymptotic behavior in Eq. (|3j) for 
continuous stationary processes, we will show that in the 
appropriate scaling limit R — > oo, N — > oo but keeping 
the ratio r = R/N fixed, the distribution P(R, N) has 
the scaling behavior, 



P(R, N) 



-N<S>(R/N) 



(9) 



where the large deviation function <3>(r) can be com- 
puted analytically. Note also that since p(<p) is symmetric 
around the origin, the number of positive members of the 
sequence must have the same distribution as the number 
of negative members, i.e., P(R,N) = P(N — R, N). Con- 
sequently, we must have $(r) = $(1 — r), i.e., the large 
deviation function over the allowed range < r < 1 must 
be symmetric around r = 1/2. 

To compute the distribution P(R, N) we use a transfer 
matrix method which has already been used successfully 
to calculate other quantities for this sequence such as 
the persistence ]lq| and the distribution of the number 
of sign changes p6| . To start with, we define Q^, N (4>o) 
denoting respectively the joint probability that the first 
member of the sequence ipi is positive (negative) and 
that the sequence of size N has a total R number of pos- 
itive members, given the value of <fi . Let us also define 
Qr,n(M = Qr !N (<Po) + Qfl,Ar(0o) which denotes the 
probability of having R positive members in a sequence 
of size N, given cf>Q. The required occupation time distri- 
bution is then given by, 



P(R,N) 



QR,N((po)p{4>o)d<po. 



(10) 



The reason for this small detour is simply that one can 
write quite easily a recursion relation for the joint proba- 
bilities N ((po). However it is not easy to write a recur- 
sion directly for the distribution P(R, N). The probabil- 
ities Qrn{<Po) satisfy the following recursion relations, 



Qr.n(<M 



Qr,n(M 



#ip(0i)Q«-i,iv-i(^i) 



-<t>o 



iK0i)Qfl,W<M- (ii) 



The above recursion relations are valid for all < R < N 
and N > 1 with the initial conditions Qq (<Pq) = 1 and 

QoAM = o. 
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These recursion relations in Eq. (|TT|) are easy to fol- 
low. Consider first the relation for Qr N (4>o)- In order 
for the first member ipi to be positive, it follows from the 
definition, ipi = 4>i + 4>o, that 4>i > —<fio for a given <f>o. 
This explains the integration range on the right hand 
side of the first line in Eq. (p"l|). Also once the first 
member is positive, in order to have a total R positive 
members, we need to ensure that the rest of the chain of 
size N — 1 (excluding the first member) has exactly R—l 
positive members. The probability of this latter event, 
for a given (f>x, is simply Qr.n{4>i)- This explains the 
integrand on the right hand side of Eq. (|ll]) . The second 
line of Eq. (]ll]) can be understood following a similar 
line of reasoning. Note that the recursion relations in 
Eq. ([Tl] ) also satisfy the one sided boundary conditions, 
Qft N (— oo) = and N (oo) = 0. The first condition 
follows from the fact that if <f>o — > — oo, then the first 
member of the sequence 4>i = (f>i + 4>o can be positive 
only with a vanishing probability. On the other hand if 
(fio — > oo, then -01 can be negative only with probabil- 
ity zero thus giving rise to the second condition. Note 
however that the values at the other boundaries namely 
Qrn(°°) an d Qrn(~°°) are unspecified. 

We next define the generating functions, 



(12) 



R=0 



with the understanding that Q% N {M = for R > N 
since R can take values only in the range < R < N. 
We also_define Qat(0o, y) = Q%{<f>o,y)+QN(<j>o,y)- Using 



Eq. (|1 1|) , it is easy to see that the generating functions 
satisfy the recursions, 

poo 

Qtf{^>o,y)^y d(f>ip((f>i)Q N -i(4>i,y) 



Q N (^o,y) = 



kp(<t>i)QN-i((f>i,y), (13) 



with the boundary conditions Q^(~oo,y) — and 
Qjf(oo,y) = for all y > 0. These generating func- 
tions also satisfy the condition Qo(4>o,y) — 1 for all y. 
The next step is to differentiate the recursion relations 
in Eq. (031) with respect to 0o which gives, 



yp{-<t>o)QN-i{-(i>o,y) 



dQjf((t>o,y) 



9Q N {<t>o,y) 



-p(-<fo)QN-i(-<h,v)- 



Further simplifications can be made by using the symme- 
try p(—(f>o) — p(4>o) and by making a change of variable 
from 0o to u((f>o) = p((f>)d<f>. Note that since p(cj>) 
is symmetric around the origin, 4>o — > —(f>o corresponds 
to u — * —u. Thus u(4>o) is a monotonic function of 4>o- 



Note further that as <fio — > ±oo, u — > ±1/2, where we 
have again used the fact that p{4>o) is symmetric around 
the origin. Let us also write, Q%(4>o,y) = S^(u,y) 
and Q N (4>o,y) = S N (u,y) where S N (u,y) = S^(u,y) + 
Sjf(u, y). Then the relations in Eq. ( Ill ) simplify to, 



dS^(u,y) 

du 
dS N (u,y) 

du 



yS N -i(-u,y) 
-S N -i(-u,y), 



(15) 



which are valid over —1/2 < u < 1/2. In terms of 
the variable u, the boundary conditions Q%{— oo, y) = 
and Q~^{oo,y) — translate to S^(—l/2,y) = and 
Sjj(l/2,y) = for all y > 0. Note also the interest- 
ing fact that the distribution p((f>) has completely disap- 
peared in Eq. ([l5|). The consequence of this, as we will 
see later, is that occupation time distribution P(R, N) is 
completely universal, i.e., independent of the distribution 
p(4>) as long as it is symmetric and continuous. 

The recursion relations in Eq. (|l^) , though much sim- 
plified, are still nontrivial since they are nonlocal in u. 
We next employ the technique of separation of variables, 
S N (u,y) = \~ N f ± (u) where we have suppressed the y 
dependence for convenience of notations. Substituting 
this form in Eq. (|l^) , we get a non-local eigenvalue equa- 
tion, 



df + 

j^ = lA [f + (-u) + f-(-u)) 

£: = -a [/+(-«) +/-(-«)] 



(16) 



where the eigenvalue A is yet to be determined. We 
also have the boundary conditions, / + (— 1/2) = and 
/~ (1/2) = 0. It is easy to see from Eq. ( |16| ) that the sum, 
f(u) = f + (u) + f~(u) satisfies the non-local first order 
equation, f'(u) — —u>f(—u) where lo = A(l — y). Differ- 
entiating this equation once more, we get a local second 
order equation, f"(u) — —Lu 2 f(u) whose most general so- 
lution is given by /(it) = Acos(luu) + B sin(uju) where A, 
B are arbitrary constants. One further notices that this 
general solution will also satisfy the first order non-local 
equation f'(u) = —ujf(~u) provided B = —A. Thus 
we arrive at the solution, f(u) — A [cos(wu) — sin(wit)]. 
Substituting this solution on the right hand side of the 
first line in Eq. ( |l6| ) and solving the resulting equation 
using the boundary condition f + {— 1/2) = 0, we get 

(14) f + (u) = [sin(wu) — cos(luu) + sin(cj/2) + cos(cj/2)] . 



(17) 



The other function f~(u) then follows from the relation, 
f~(u) = f(u) — f + (u) where /(w) = A[cos(uu) — sin(wu)] 
and f + {u) is given by Eq. (p7|). The function f~(u) still 
has to satisfy the boundary condition /~(l/2) = 0. In 
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fact, this condition determines the eigenvalue A and we 
get tan(u;/2) = (1 — y)/(l + y) where u> = A(l — y). For 
large N, only the smallest eigenvalue A will dominate 
which is given by 



A = 



(i-y) 



tan 



1 + y 



(18) 



Using the exact X(y) from Eq. (|18|), we are now ready 
to compute the large N behavior of the occupation time 
distribution P(R, N). In Eq. (fiol), after making a change 
of variable (f>o — + u, we find the generating function, 
J2 M P ( R > N )y R = f- 1 2 / 2 s N(u,y)du. We substitute the 
large N behavior Sn(u, y) » \~ N f(u) and carry out the 
integration using the exact expression of f(u) to obtain 
the following exact large N result, 



9 A 

J2 P(R,N)y R *— S m(u;/2)[\(y)}- N , 

M=0 



(19) 



where \(y) is given by Eq. (|18[). By inverting the 
generating function and carrying out a standard steep- 
est decent analysis for large N, large R but keeping 
the ratio r = R/N fixed, we get the desired result, 
P(R,N) ~ exp [— N$(R/N)] where the large deviation 
function $(r) is given by the exact formula 



<f>(r) = max 

y 



log 



r-tan 

(i-i/) Vi + y 



(20) 



We first note that the function Y(y,r) = 
22/ r tan" 1 [(1 - y)/(l + y)] /(l - y) inside the 'log' in Eq. 
( pp| ) is invariant under the transformation y — > 1/y and 
r — * 1 — r, i.e., F(y,r) = F(l/t/, 1 — r). This obviously 
indicates that $(r) = $(1 — r) as expected. Determin- 
ing $(r) in closed form seems difficult, though it can be 
obtained quite trivially using Mathematica, as displayed 
by the solid line in Fig. 1. 



0.5 







0.5 



FIG. 1. The large deviation function $(r) plotted against 
r. The solid line corresponds to the large deviation function 
for the 1-d sequence and is obtained from Eq. ( po| ) using 
Mathematica. The dotted line corresponds to that of the SK 
model obtained using Mathematica in Eq. ( pS5| ) after the shift 
$(r) = 6(2r - 1). 

It is however easy and instructive to obtain analyti- 
cal expressions of $(r) in the regimes near r = and 
r = 1/2. It turns out that these limits correspond re- 
spectively to y —* and y — ► 1 in the function Y(y,r). 
Keeping r fixed we expand Y(y, r) for small y and near 
y — > 1 in a Taylor series and then take the logarithm and 
maximize to obtain the following limiting behaviors, 



$M = log ( g 



r log 



6/ 1 

- r 

5 V 2 



(4 - 7r)e 
,r-»l/2 



,r -> 



(21) 



These limiting forms have interesting physical implica- 
tions. Consider first the limit r — > or equivalcntly 
0. Note that P(0,iV) = P(N,N) - exp[-$(0)iV] 
is just the probability that all the members are either 
negative or positive up to length N which is precisely 
the persistence of the sequence. The persistence for this 
sequence was earlier computed in Ref. ]l5| and it was 
found to decay for large N as exp[— ON] with the persis- 
tence exponent 9 — log(7r/2). Thus the limiting form of 
$(r) as r — > in Eq. ( ^l| ) is consistent with the persis- 
tence exponent, 9 = $(0) = log(7r/2). 

The other limit r — * 1/2 is also interesting and can 
be derived independently from a central limit theorem. 
To see this wc find from Eq. (@) that R - (R) = 
YnLii^i ~ ( x i)) where (R) = N/2, x { = 9(ipi) and 
(xi) = 1/2. In general the summands [xi — (xi)) are 
of course highly correlated and one can not employ the 
central limit theorem to evaluate the sum. However, one 
can do so in the limit when M — > (M) when the vari- 
ables (xi — (xi)) become only weakly correlated. Then 
the central limit theorem predicts a Gaussian distribution 
for the sum, P(R,N) - exp [-(R - N/2) 2 /2a 2 N ] where 
cr^f = ((R — N/2) 2 ) is the variance. One can calculate 
this variance independently by computing the correlation 
functions ((a;, — l/2)(:Ej — 1/2)) where Xi = 8(ipi)- It is 
shown in the Appendix that for large N, a 2 N = 5N/12. 
Hence the central limit theorem predicts that in the limit 
R-+1/2, P(R,N) ~ exp [-6N(r - l/2) 2 /5] thus yield- 
ing exactly the same limiting form of <&(r) for r — * 1/2 
as in Eq. @). 

Thus the occupation time distribution P(R,N), 
though Gaussian near the mean value R — N/2, becomes 
non-Gaussian as r = R/N deviates away from its mean 
and approaches the tails r — > or r — » 1. This crossover 
from Gaussian behavior near r — 1/2 to non-Gaussian 
behavior near r — > 0, 1 is characterized by the large devi- 
ation function $(r) changing from a quadratic function 
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near r = 1/2 to non-quadratic behavior near r — > 0, 1 as 
given by Eq. (|2l"|). We conclude this section by noting 
the important fact that the large deviation function <f>(r) 
in Eq. (|2(]) and in fact the full occupation time distribu- 
tion P(R,N) is completely universal, i.e., independent of 
the distribution p(<f>) (as long as p((f>) is symmetric and 
continuous). Moreover, this universality holds for any 
arbitrary N and not just asymptotically for large N. 



III. RELATION TO SPIN GLASS MODELS 

We start this section by raising a physical question for 
a general spin glass model defined on a finite lattice of 
N sites: What is the distribution (over disorder) of the 
thermally averaged number of spins that are aligned to 
their local fields? This distribution depends on the tem- 
perature and on the system size TV. It turns out that 
the distribution remains nontrivial even in the infinite 
temperature limit. In fact, for a nearest neighbor Ising 
spin glass chain, we show that this infinite temperature 
limiting distribution is precisely that of the occupation 
time distribution of the toy sequence computed in the 
previous section. The infinite temperature limit, though 
nontrivial, is tractable in few other cases such as the SK 
model of Ising spin glass which will be discussed in detail 
in the next section. 

Consider a spin glass model on a lattice of N sites de- 
fined by the Hamiltonian, 



E 



jSiSj, 



(22) 



<%,j> 



where Si's are the spin variables (not necessarily Ising) 
and Jij denotes the coupling between site i and site j. 
In the nearest neighbor model, the sum in Eq. ( |22] ) runs 
over nearest neighbor pairs. On the other hand, for long 
range mean field models such as the SK model, the sum 
runs over all pairs of sites. The variables Jij's are inde- 
pendent and each is drawn from the identical distribution 
p(J), which we assume to be symmetric and continuous. 
Henceforth we will use the short hand notation J and 
S to denote respectively the set of couplings and the 
set of spins. Thus the J's have the joint distribution 
Q J dJ = Y\i j p(Ji,j)dJij . The local field that a spin 

at site i sees is simply hi = J2j If the spin gets 

aligned to its local field, we must have hiSi > 0. Hence 



J,S 



in a given configura- 



the total number of spins N a 

tion that are aligned to their local fields can be formally 
written as, 



N a 



J,S] =J29[hiSi\ 



Si ^ J; 



1,3 



(23) 



Evidently iV a is a random variable that depends on the 
couplings J as well as the spins S. Let us first compute 
the thermal average of N a over the spin configurations 
for a fixed quenched disorder J, 



(24) 



where Z — e P E ( S ) is the partition function and /3 is 
the inverse temperature. This thermal average iV a (j\ 
is a random variable that varies from one realization of 
disorder to another. We then ask: what is the probabil- 
ity distribution of this random variable (over disorder) 
at a given inverse temperature (31 This probability dis- 
tribution Prob(iV a = R) — Pp{R,N) can be formally 
represented as, 



Pp(R,N)= / S 



R-N 



,(/) 



Q 



J 



dJ. 



(25) 



The analytical calculation of Pp (R, N) at arbitrary (3 
seems difficult. Let us, therefore, consider a simpler limit, 
namely the limit of infinite temperature or equivalently 
— > 0. In this limit, the thermal average in Eq. ( |2~i| ) 

becomes simple, iV a ^J^ = J^s N a [J, S]/Nc where Nq 
is the total number of spin configurations. Thus all spin 
configurations are equally likely. However, the distribu- 
tion Pq(R,N) as in Eq. (^5|), even in this infinite tem- 
perature limit, is still nontrivial. 

Let us now focus on Ising spins where Si = 
±1. Here Nc = 2 N where N is total number 
of lattice sites. The Eq. @, using Eq. (§|), 
then becomes simpler for the Ising case, Pq(R,N) — 

Js[R-^J2sJ2lid (Ej Ji,i SiSj)] Q [f] dJ. The 
next step is to make a gauge transformation, <f>ij — 
JijSiSj. Since the spins are Ising, i.e., S{ = ±1, (p^/s 
have the same distribution as the Jjj's. The advantage 
of this gauge transformation is that one can then do away 
with the configuration sum over the spins and we simply 
get, 



P (R, N) 



N 






R -}2 e 




Q 


$ 


i=l 







dip. (26) 



Now consider the special case of a nearest neigh- 
bor Ising spin glass chain of size N where E = 
— Ei Ji,i+iSiSi+± with free boundary conditions. Var- 
ious properties of this spin glass chain such as the statis- 
tics of the number of metastable states have been studied 
analytically by Li p8| and by Derrida and Gardner ]l9| . 
Recently it was also shown that the persistence in the toy 
sequence studied in this paper is the same as the average 
fraction of metastable spins in the Ising chain Jig ]. In 



G 



the present context, we find the distribution Pq(R, N) in 
Eq. reduces to, 



Po(R,N) = J 6 



R 



N 



Q 4> dcj>. (27) 



Comparing Eqs. ( p7| ) and flq) one immediately finds 
P Q {R,N) = P(R,N), where P(R,N) is precisely the 
occupation time distribution that was computed in Sec. 
II. This thus establishes the promised link between the 
spin glass problem discussed in this section and the non- 
Markovian toy sequence discussed in Sec. II. The infinite 
temperature distribution (over disorder) of the thermally 
averaged number of locally aligned spins is identical to 
that of the occupation time distribution of the toy se- 
quence discussed in Sec. II. From the exact results of 
P(R, N) derived in Sec. II, one therefore knows the dis- 
tribution Pq(R,N) exactly as well. 

A question naturally arises: Are there other solvable 
cases for Pq(R, N) apart from the 1-d chain? In the next 
section we show that indeed the infinite range SK model 
is one such case where the distribution Pq (R, N) can be 
computed analytically. 



IV. THE SK MODEL 

In this section we calculate the infinite temperature 
distribution Pq(R,N) of the thermally averaged number 
of locally aligned spins in the infinite range SK model 
defined by the Hamiltonian in Eq. (22) where 
runs over all pairs of the total number of N sites. The 



couplings Jij's are independent of each other and we 
assume that each is drawn from a Gaussian distribu- 
tion, p(J) = ^N/2ire~ Nj2/2 . The choice J ~ 7V" 1/2 
is necessary to ensure that the free energy is extensive 
in the large N limit. It is clear from Eq. ( p6| ) that 
if we define ipi = z^j^i'Pi-j where each of the </>ij's 
are independent Gaussian variables with the distribution 
p{(/>) = ^/Nj2^e- N ^l 2 , then R = £jLi<?(^i)- It turns 
out that for technical reasons it is easier to consider the 
variable M = s S n (V'i) where sgn(x) = 29(x) — 1. 

Hence M = 2R—N. In what follows we will first compute 
the distribution Pq (M, N) and derive the corresponding 
distribution of R using the simple shift M = 2R — N . 

Since we are eventually interested in the limit N — > oo, 
M — > oo but keeping the ratio m = M/N fixed, we set 
M = mN and write 



N 



P {m,N) 



^sgn(V>i) - mN 



.4=1 



^ -iumN I M Y]^ :1 sgn(i/>i) 



2tt 



(28) 



where we have used the representation of the delta func- 
tion, S(x) — J x oo e l,J - x dp J /2TT and () . denotes the expec- 



tation over the distributions of Using the identity, 
(e^ ssn ^)> = Ea=-i.i eiA " T (% cr )) y > one can rewrite 
Eq. (E§) as 



' 71 {o-«=-l,l} \ i 



(29) 



We next use the representation, 9{x) = 

fo° dl J-ac^ eiX(x ~ l) in Ec l- ©> make the transfor- 
mation AjCi — * Xi and /i — > — /x and then sum over the 
Oi variables to obtain 



d/i 
2^ 



P (m,N) - 

x TT 2 cos(p + Xik) 



gi/xmAT x 



dh 



dXj 
2tt 



(30) 



The next step is to evaluate the the expecta- 



tion value (J^i e%Xi ^*)^- Using tpi 



n 



i(\i+Xj 



Us- 



we note that 

ing the Gaussian distribution p(<j>ij), one can eas- 
ily evaluate the expectation value to finally obtain, 



01, e**) 



exp 



Ey(A* + A;) 2 /4iV 



We next 



expand the sum, £\.(A 4 + Xj) 2 = 2N^ t A? + 2(£ ( A,) 2 
and use a Hubbard-Stratonovich transformation, 

ex P [-(E 4 A 2 ) 2 /27V] = ^f^dze^*-"* 2 /*, to 
finally write the expected value of the product as, 



J —OQ 

(31) 



We then substitute Eq. @ in Eq. @ and carry out 
the Gaussian integrations over the variables Zj's and Aj's 
to get 



P o{m , N ) = M T p e ^ N 
V 27T Zn 



dze- Nz2/2 [A{z,p)} 1 
(32) 



where the function A(z,p) is given by 



A{z,n) 



e^erfc [A= \ + e _i "erfc 



V2) 



(33) 



with erfc(a;) = -7= e u du being the complementary 
error function. 

We next expand the right hand side of Eq. (|3^) in a 
binomial series, substitute the resulting series in Eq. ( |32] ) 
and carry out the integration with respect to /1 to obtain, 
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— oo 
(l-m)AT/2 



dze Nz * /2 ^i^! n ) N S jx 



erfc(-z/V2 



(l+m)N/2 



■ (34) 



Keeping to fixed we then use the Stirling's formula to ap- 
proximate the combinatorial factor in Eq. ( |3^ ) for large 
N and then use the steepest descent method to evaluate 



the integral in Eq. (34) for large N. This gives, ignoring 
pre-exponential factors, a similar asymptotic behavior as 
in the 1-d case, Po(m,N) ~ e -Jve(m) wnere ^he i ar g e 
deviation function 0(m) in this case is given by, 



QOm) = — min 
2 z 



■=-i,i 



(1 — ma) log 



(1 — ma) 
2crfc(zCT/ % /2) 

(35) 



In terms of the original variable r = (1 + m)/2, 
the distribution is then given by Po (r, N) = Pq (to = 
2r - l,N)/2 ~ cxp[-iV$(r)] with $(r) = 0(2r - 1) 
where Q(x) is given exactly by Eq. (p5|). The function 
$(r) is symmetric around r = 1/2 since 0(m) in Eq. 
( |35| ) is symmetric around to = 0. As in the 1-d case, it 
seems difficult to obtain a closed form expression of $(r). 
However, it can be easily evaluated from Eq. (|35| ) using 
Mathematics as displayed by the dotted line in Fig.l . 
Moreover, similar to the 1-d case, one can evaluate $(r) 
analytically near r = 1/2 as well as near the tail regions 
r — > 0, 1. Omitting the details of algebra, we find, 



bv 

<5(r) = a + r log ( — 

e 



2tt 



,r -> 
.,r->l/2, 



(36) 



where a = log(2) + zg/2 - log [erfc (z /a/2~)] 
erfc (zo/y/2) /erfc (— z /^/2) and z is fV, ° 
equation, 



■20 



erfc (z /\/2) 



the root of the 
(37) 



Solving Eq. (p7|) numerically yields zq — —0.506054 . . . 
which gives a = 0.493919 ... and 6 = 2.26361 . . .. 

Thus the limiting behaviors of <&(r) near r = 1/2 
and r — > 0,1 in the SK model in Eq. (36) are qual- 
itatively similar to those in the 1-d case in Eq. (21). 
Using similar arguments as in Sec. I, one can under- 
stand the behavior near r = 1/2 as a consequence of 
a central limit theorem which predicts a Gaussian be- 
havior for Pq{R,N) ~ exp[-(i? - N/2) 2 /2a 2 N ], where 
a%_= ((R— N/2) 2 ) is the variance. Comparing with Eq. 
( pq ) we find that the variance for large N is given exactly 
by, a 2 N = (tt + 2)N/A%. This result for the variance can 
also be derived by a direct method as shown in the Ap- 
pendix, thus providing an additional consistency check. 



The results for the statistics of R in the 1-d case and in 
the SK model can be jointly summarized as: The mean 
is always given by, (R) — N/2 and the variance for large 
N is given by, 



rir = —N, 



4tt 



-N, 



1-d 
SK. 



(38) 



The behavior in the tail region near r — > (or 
equivalently near r — > 1) is also interesting. Let us, 
for simplicity, consider the r — > 1 limit where R = 
N. It is clear from the expression of Pq(N,N) in 
Eq. (Pq) that the delta function will contribute only 
when each of the 9 function inside the sum are sat- 
isfied, i.e., if FT 9 , ] = 1. Thus evidently 



Pa(N,N) = (lli0(£#i^Hj)) • But this quantity 
is just the average number of metastable configurations 
in the spin glass. To see this clearly, consider again 
the spin glass Hamiltonian in Eq. p2|). A spin con- 
figuration is called metastable if the energy required 
to flip any of the N spins is strictly positive. In 
other words, all the spins must be aligned to their lo- 
cal fields in a metastable configuration. Hence, for 
a fixed disorder, the fraction of metastable configura- 
tions (out of the total number of 2 N spin configura- 
tions) is given by, /(J) = 2~ N J2§ Yli 9{hiSi) where hiS 
are the local fields. Finally, the average (over disor- 
der) fraction of the metastable configurations is given 



by, (f(J)) j - If(J)Q 



J 



dJ. Using once again the 



gauge transformation for Ising spins, (f>i.j — JijSiSj one 
can easily express this average fraction as (/( J)) 7 = 
(Ui&(E j7 ti4>i,j)) = Pq(N,N). Our results on the 
large deviation function near r = 0, 1 in Eq. (^) in- 



dicates that P (N,N) = P o (0,N) 



for large N 



with a = 0.493919 . . .. On the other hand, the average 
number of metastable configurations in the SK model 
was computed long ago by Tanaka and Edwards J2£] and 
also by Bray and Moore |H| and this average is known 
to increase exponentially for large N as ~ e aN where 
a = 0.1992 pi] . Hence the average fraction scales as 
~ e aN /2 N = er cN with c = log 2- a = 0.4919. Thus the 
constant a in Eq. ( [36] ) is precisely the same as the con- 
stant c and hence the limiting behavior of our large devi- 
ation function near the tails r = 0, 1 is completely consis- 
tent with the calculation of average number of metastable 
states. 

Let us conclude this section with the following com- 
ment. In the case of the 1-d toy sequence, we found in 
Sec-II that the full occupation time distribution P(R, N) 
and consequently the associated large deviation function 
is completely independent of the distribution p(<f>). In the 
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case of the SK model, we have derived the large devaition 
function for a specific form of the disorder distribution, 
namely the Gaussian form. Naturally the question arises 
as to how universal is this large deviation function as one 
changes the disorder distribution. Evidently for finite N 
the results in the SK case, unlike the 1-d case, will de- 
pend on the details of the distribution p(J). However, 
due to the 1/yN scaling in the definition of the distri- 
bution of the Jij , the large N results inculding the large 
deviation are universal (upto rescaling by a constant fac- 
tor), provided the variance of the Jy's is finite. In the 
case of mean field spin glasses with power law or Levy 
interactions |2j|, the variance of the Jy is no longer fi- 
nite and it would be interesting to study the occupation 
time distribution in this context. 



V. SUMMARY AND CONCLUSION 

The three main points of this paper are: (i) we have 
been able to derive, for the first time, exact results for the 
occupation time distribution of a non-Markovian process. 
In our case, the stochastic process is not continuous in 
time, but rather a discrete toy sequence. Nevertheless 
this toy sequence retains the non-Markovian property 
which makes the results nontrivial. Besides the fact that 
exact results are always useful and instructive, this toy 
sequence also appears in various physical contexts such 
as diffusion process and spin glasses, thus extending the 
range of applications of our results, (ii) We also estab- 
lished an exact mapping of this sequence to an Ising spin 
glass chain using a gauge transformation. The occupa- 
tion time distribution in the sequence then translates, via 
this mapping, into the distribution of the thermally aver- 
aged number of spins that are aligned to their local fields 
in the spin glass chain at infinite temperature when all 
spin configurations are equally likely. This observation 
raises an interesting new question for any generic finite 
sized spin glass model: at a given temperature, what is 
the distribution of the thermally averaged number of lo- 
cally aligned spins? Our exact results in one dimension 
show that this distribution remains nontrivial even at in- 
finite temperature, (iii) We then were able to compute 
analytically this infinite temperature distribution in the 
SK model of spin glasses with Gaussian disorder and ar- 
gued that for very large N, the associated large deviation 
function is again universal, i.e., independent of the pre- 
cise form of the disorder distribution. 

We leave open the possibility of computing this distri- 
bution at a finite temperature for any spin glass model. 
For example, it would be interesting to know how this dis- 
tribution changes as one goes below the spin glass tran- 
sition temperature. 

The study of the number of metastable spins in various 
other spin glass models is an open question. We mention 
a few cases where exact results along these lines may 



be possible as the average number of metastable states 
is calculable: the SK model in the presence of external 
fields ^2|, p-spin spin glass models spin glasses on 
random graphs [p4|, mean field spin glasses with Levy 
interactions | p5[ , the Hopfield neural network model |26| ] 
and the Random Orthogonal Model [^Tj. The study of 
spin glass models on random graphs of fixed connectivity 
c are of particular interest as they interpolate between 
the one dimensional toy model studied here, at c = 2, 
and the SK model in the limit c — > oo. 



APPENDIX A: DIRECT CALCULATION OF THE 
VARIANCE OF M 

In this appendix we compute the variances of the oc- 
cupation time both in the one dimensional toy sequence 
and in the SK model by a more direct method. These 
results are identical to those obtained from the limiting 
forms of the large deviation functions near r = 1/2. 

We have in general, 



' N 



N 

T 



i^sgnOOsgnOj) 



N 

T ' 2 



(Al) 



where we have made use of the identities (0(ipi)) = 1/2 
for all i and sgn(x) = 20(x) — 1. 

In the one dimensional model only neighboring sites 
are correlated and hence one has 



N 



' N 



2^1 (0(^)0(^ + i))- - 



N 



2N{0(^ 1 )0{^ 2 )) 



(A2) 



where we have used the isotropy of the sites in the large 
N limit. One now has that 

d<p o d<pid<p 2 p(<l>o)p(<l>i)p(<p2)0(<l>o + (f>i)9(<t>i + <fa) 



#1/9(^0 / d0 2 p(0 2 ) (A3) 

1-4,! 



We introduce the function F(cf)) = J^°, d<p'p((j)') and use 
the relations p(<f>) — p{~4>) and dF/dcj) — p((j>) to carry 
out the integration and thus obtain 



(A4) 
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Putting this altogether gives the large N asymptotic re- 
sult aj^ = 5 AT/12. In fact using the generating function 
technique used in this paper, and hence taking into ac- 
count the boundary terms exactly, one can show that 
(7% = 5N/12 - 1/6 for any AT. Note that this result, for 
arbitrary AT, is independent of the precise form of the 
distribution p{4>). 

We now turn to the SK model where ipi = ^hj 
and the random variables 4>ij 's are independent and iden- 
tically distributed with the Gaussian distribution p(<f)) = 
^/Nj2^ e -N4> 2 /2. Clearly, one has (^ 2 ) = (1 - 1/AT) and 
(tpiipj) — 1/AT. We next use the well known identity that 
holds only for Gaussian random variables, 



(sgn(X)sgn(F)) = - sin" 1 f - 



Using this identity we get for i ^ j, 

2 _, ( 1 
(sgn(^)sgn(^)) = -sin (^7— j" 



This yields 



N 



N > 
= — + —N(N-l)t 
4 2ir 1 ; 



1 



AT- 1 



(A5) 



(A6) 



(A7) 



which gives the result a 2 N = N(ir + 2)/4ir in the limit of 
large AT. We note that the result in Eq. flA7|) for finite AT 



is valid only when the distribution of the <j>i t j is Gaussian. 
The finite AT result for arbitrary distribution of the (frij 
will depend in general on the details of the distribution 
and hence, in contrary to what happens in the one di- 
mensional toy model, will not be universal. However, as 
argued in Sec. IV, the large AT results including the re- 
sult for the variance, i.e., er 2 ^ = AT(7r + 2)/4tt is universal 
as long as the variance of the 4>i_j is finite. 
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